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Abstract 


The genetic diversity of 2 pairs (AH65 and AH187) of wild type bovine coronaviruses (BCoV) sequenced directly from nasal (respiratory) 
and rectal (enteric) swabs of two feedlot calves with respiratory and enteric symptoms [Hasoksuz, M., Sreevatsan, S., Cho, K.O., Hoet, A.E., 
Saif, L.J., 2002b. Molecular analysis of the S1 subunit of the spike glycoprotein of respiratory and enteric bovine coronavirus isolates. Virus 
Res. 84 (1-2), 101—109.]. was analyzed. Sequence analysis of the complete genomes revealed differences at 123 and 149 nucleotides (nt) 
throughout the entire genome between the respiratory and enteric strains for samples AH65 and AH187, respectively, indicating the presence of 
intra-host BCoV quasispecies. In addition, significant numbers of sequence ambiguities were found in the genomes of some BCoV-R and BCoV- 
E strains, suggesting intra-isolate quasispecies. The tissue culture (TC) passaged counterparts of AH65 respiratory BCoV (AH65-R-TC) and 
enteric BCoV (AH65-E-TC) were also sequenced after 14 and 15 passages and 1| plaque purification in human rectal tumor cells (HRT-18), 
respectively. Compared to the parental wild type strains, tissue culture passage generated 104 nt changes in the AH65-E-TC isolate but only 8 nt 
changes in the AH65-R-TC isolate. Particularly noteworthy, the majority of nucleotide changes in the AH65-E-TC isolate occurred at the 
identical positions as the mutations occurring in the AH65-R strain from the same animal. These data suggest that BCoV evolves through 
quasispecies development, and that enteric BCoV isolates are more prone to genetic changes and may mutate to resemble respiratory BCoV 
strains after tissue culture passage. 
© 2007 Elsevier Inc. All rights reserved. 
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Coronaviruses (CoVs) are enveloped viruses with single- 
stranded RNA genomes of positive polarity. They belong to the 
Coronavirus genus in the Coronaviridae family of the order 
Nidovirales (Masters, 2006). The CoVs belong to three different 
groups based on antigenic and genetic properties. Bovine 
coronavirus is a member of group 2a whereas Severe Acute 
Respiratory Syndrome (SARS) CoVs comprise group 2b. The 
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genome of bovine CoVs typically contains 11 ORFs encoding 
11 structural and non-structural proteins including a polymerase 
la and 1b polyprotein, a 32 kDa non-structural protein (NSP), 
hemagglutinin esterase (HE), spike protein, 4.9, 4.8, and 
12.7 kDa NSP, E, M, and N proteins (Cavanagh et al., 1990). 

Bovine coronaviruses were first recognized as enteric 
pathogens (BCoV-E), that are generally associated with 
neonatal calf diarrhea (NCD) and winter dysentery (WD) in 
adult cattle (Benfield and Saif, 1990; Mebus et al., 1973a, 
1973b; Saif, 1990; Tsunemitsu and Saif, 1995). Besides their 
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enteric pathogenicity, BCoVs were also identified as respiratory 
pathogens in cattle. The association of respiratory BCoV 
(BCoV-R) with calf pneumonia was first reported. Then later 
BCoV-R was identified in association with shipping fever of 
feedlot cattle (Cho et al., 2001b; Hasoksuz et al., 2002a; 
Lathrop et al., 2000; Storz et al., 2000a, 2000b). It is suggested 
that the shipping of cattle is a stress factor, and like weaning and 
dietary changes, it is a predisposing condition for initiating 
cattle respiratory disease. Under either natural or experimental 
conditions, nasal and rectal shedding of BCoV is frequently 
detected concurrently from infected calves (Cho et al., 2001a, 
2001b; Reynolds et al., 1985). 

To examine the genetic diversity between BCoV-E and 
BCoV-R strains and the intra-host viral evolution of BCoV, viral 
RNA was extracted directly from 2 pairs (AH65 and AH187) of 
nasal and rectal swab fluids of two BCoV-infected calves with 
overt respiratory and enteric symptoms, from the same feedlot 
but collected during different years (Hasoksuz et al., 2002b). 
Samples AH187 and AH65 were collected from feedlot calves 
in 2000 and 2001, respectively from the Ohio Agricultural 
Research and Development Center (OARDC) feedlot. Both 
calves were 5—7 months of age. The complete genomic 
sequences of the two BCoV-E/BCoV-R pairs (AH65-R, 
AH65-E, AH187-R, and AH187-E) were determined. The 
tissue culture-passaged counterparts of AH65-R and AH65-E 
(AH65-R-TC and AH65-E-TC) were also sequenced to define 
the genomic changes resulting from in vitro adaptation and 
passage. The AH65-R-TC was derived from the original field 
AH65-R isolate after an initial 6 passages and 1 plaque 
purification step followed by an additional 8 passages in HRT- 
18 cells. The AH65-E-TC was derived from the AH65-E isolate 
after 8 passages and | plaque purification step followed by an 
additional 7 passages in HRT-18 cells. RNA was extracted from 
the tissue cultured viruses after centrifugation (2800xg for 
15 min at 4 °C) to remove infected cells. For the nasal and rectal 
swabs, samples were subjected to centrifugation (2800xg for 
15 min at 4 °C) and ultrafiltration (0.22 um membranes) before 
RNA extraction. Specific oligonucleotide primers were 
designed using a calf diarrhea DB2 strain of enteric BCoV 
(BCoV-E-DB2) genome in GenBank (accession DQ811784). 
Primers were designed at every 500 bp along the genome. 
Oligonucleotide primers incorporating an M13 sequence tag 
(forward primer: TGTAAAACGACGGCCAGT; reverse pri- 
mer: CAGGAAACAGCTATGACC) were used for sequencing. 
Primer sequences are included in Supplementary Table 1. 
Amplicons prepared for sequencing were generated by reverse 
transcription and polymerase chain reactions (RT-PCR) as 
described previously (Zhang et al., 2007). Each amplicon was 
sequenced from each end using M13 forward and reverse 
primers listed above. Sequencing reactions were analyzed on a 
3730 ABI sequencer (Applied Biosystems, CA, USA). 
Sequencing reads were downloaded, trimmed to remove 
amplicon primer-linker sequences as well as low quality 
sequence and assembled using TIGR Assembler (www.tigr. 
org/software/assembler/). Strain specific primers were designed 
for RT-PCR to close gaps between assembled contigs. 
Additional primer design, cDNA synthesis, and sequencing 


were performed to ensure greater than 4x sequence coverage 
along the coronavirus genomes. Assemblies were manually 
edited using CloE (Closure Editor), a TIGR program for editing 
assemblies. All apparent polymorphisms were checked against 
reference data and ambiguities were exhaustively analyzed by 
RT-PCR and cloning. The final genome assemblies have been 
deposited in GenBank. The GenBank accession numbers are as 
follows: AH65-E: EF424615; AH65-E-TC: EF424616; AH65-R: 
EF424617; AH65-R-TC: EF424618; AH187-E: EF424619; 
AH187-R: EF424620. 

The genomic lengths of the BCoV strains in nt are 31,017, 
30,970, 31,016, 30,995, 30,995, and 30,935 for AH65-E, 
AH65-E-TC, AH65-R, AH65-R-TC, AH187-E, and AH187-R, 
respectively. The 6 genomic sequences were comparable in size, 
and had no deletions and insertions. The genomes were aligned 
using ClustalW in DNAstar and ends were trimmed to remove 
primer sequences and obtain uniform length after alignment. 
The 6 genomes are uniformly 30,894 nt in length after ends 
trimming and are typical of BCoV genomes containing 5’ and 3’ 
ends, structural and non-structural protein coding sequences 
arranged as follows: 5’-UTR (nt 1-204), ORFla (nt 205— 
13,356), ORF 1b (nt 13,335—21,488), 32 kDa NSP (nt 21,498— 
22,334), HE (nt 22,346—23,620), S (nt 23,635—27,726), 
4.9 kDa NSP (nt 27,716—27,805), 4.8 kDa NSP (nt 27,883-— 
28,020), 12.7 kDa NSP (nt 28,100—28,429), E (nt 28,416— 
28,670), M (nt 28,685—29,377), N (nt 29,387—30,733), and 3’- 
UTR (nt 30,734—30,894). The 6 genomes share over 99.5% 
sequence identity and together they share over 99.2% nucleo- 
tide similarity with the BCoV-E-DB2 strain (accession 
DQ811784). The reference BCoV-E-DB2 strain was isolated 
from a diarrheic neonatal calf in the OARDC dairy in 1995 
(Tsunemitsu and Saif, 1995). The high sequence homology and 
the fact that no deletions and insertions were found in any of the 
6 genomes suggest that the 6 BCoVs were derived from a 
similar DB2-like BCoV-E strain that has been circulating in the 
geographically closely related calf herds. 


Intra-host quasispecies 


The enteric AH65-E BCoV differed from the respiratory 
counterpart AH65-R at 123 nucleotide positions (Table 1), of 
which 34 mutations were non-synonymous changes in 
structural and non-structural proteins. Likewise, the enteric 
AH187-E and its respiratory counterpart AH187-R viruses were 
distinguished by 149 nucleotide differences (Table 2), of which 
76 of the 149 nucleotide mutations were non-synonymous. Pair- 
wise comparison among the 6 genomes was performed and 
divergent nucleotide positions are summarized in Table 3. The 
calculated number of nucleotide and non-synonymous muta- 
tions included the ambiguous nucleotides and amino acids. 
Nucleotide symbols beyond A, T, G, and C represent ambiguity 
nucleotide positions: R=A or G, Y=C or T, M=A or C, K=G 
or T, W=A or T. Only | common mutation was found in the two 
pairs at either nucleotide or amino acid level (nt position 27947 
resulting in a common T to I mutation from BCoV-E to BCoV-R 
in the 4.8 kDa NSP). The fact that significant numbers of 
nucleotide polymorphisms were found to exist in respiratory 
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Table 1 


Nucleotide differences between AH65-E and AH65-R and mutations in AH65-E-TC and AH65-R-TC after tissue culture passage* 


9 % Fs ce 9 : 3 % 9 %) 
Pe ce - , & PE - . & 
mo me a mo me me oe a en 
8 S$ 8 $s 8S $8 8 S$ S$ 8 $8 SS 8 $8 8S S$ 8 $ 
2 jem) xc x = 2 x= ££ x =f x 2 qc ft zte 2 c x 3) om 
#¢ <= < <= = -¢ #+ £242 ¢ #4 4-24 4 # =< <= <= #2 =< <= <= 
5’ UTR nt1-204 ORF la gene continued ORF Ib gene nt13335-21488 S gene nt23635-27726 4.9 KDNSP gene nt27716-27805 
3 C C T T 7638 G G K T [14513 G A A A]|{[23664 A C C C][27746 A G G G 
ORF la gene nt205-13356 7767 G G R R_ 1464 T C T T 4/2374 C G G G||277998 Cc T T T 
1137 G A A Aj] 7947 T T G G_ 1464722 T C T T 12376 T C C C} {27812 T C C_ C 
T] 7971 T T Y C 15011 T C T TT 423835 T C C C| 4.8KkDNSP gene nt27883-28020 
fis A or | 23971 A G G G]/[27936 T C C C¢ 
390 C C T T 24039 C T T TI}|279943 C Tr T T 
9402 T T C c [16193 T C C C] {2416 T C c cC||279947 C Tr T T 
16198 T T G G_ {24237 T C Cc c| [2803 T C C Cc 
1022063 G G A A [16505 T G G G|]| {2424 T C Cc Cl} 2808 G G T T 
10692 T T C C 16589 T C T T (2460 T C C C] 280897 C T C C 
10726 C T C C 1778 C T C C_  }24786 T C C C| 12.7kKDNSP gene nt28100-28429 
171022 GA G_G_ {25228 G A A A| [28382 A GGG 
T T Y T G T T T]|/25296 C TF T T M gene nt28685-29377 
G GR R c T T TI 425305 C T T TI{ 28855 T C C C 
T T Y Y c T T TY 4/2534 T C c cli29110 T C Cc Cc 
cccy T C Cc CI }253599 C T T TI _29215 T T CC 
Cc C¢ Tf £ c T T TI 4/2552 T CCe N gene nt29387-30733 
G GGebR G A A A] |25884 A C C T/f29680 A G GG 
C-€ -c- 7 T G G G||2124 Cc T T Cl] 29702 A C C C 
A G G G|}2621I5 T C c T|{29823 T A A A 
AT Te.t c Tr T TY [26718 Cc 7T T Cl 3018 T T C C 
c Cc T T A G G Gl] 2791 C A C C [3016 G AAA 
c c T T T G_G_G Cc T T Tl 305335 G G A A 
A A TT  32KkDNSP gene nt21498-22334 T A A A| 30543 C C T T 
c c T T (21999 AC C C c T T TI] 3039 T T C C 
C C- A A HE gene nt22346-23620 A GGG 
A A C C [22559 G T T c T TT 3’ UTR nt30734-30894 
22600 iT cc A C Cc C| 3093 T T T G 
22656 A G G A T T T 
22826 C c Cc T AAA 
23242 CC T_T T A A A 


23384 «iT 


A T T 


“The four genomic sequences were aligned with ClustalW program in Lasergene software (DNASTAR Inc). Uniform length of 30,894 nt was achieved after 
trimming sequence ends. PPositions for nucleotide changes in individual genes are shown. ‘Nucleotides are in bold italics and boxed when AH65-E mutated to 
nucleotides identical to AH65-R after tissue culture passage. “The ambiguous nucleotides are in bold italics and underlined. 


and enteric BCoV strains from the same animals indicates the 
presence of an intra-host quasispecies. These mutations were 
scattered throughout the entire genome, but the spike gene had 
higher mutation number to gene length ratios relative to most of 
the other gene regions (mutation n=31 in spike gene out of total 
123 nt differences for AH65-E/AH65-R pair and n=30 out of 
total 149 nt differences for AH187-E/AH187-R pair). This 
suggests a higher selective pressure on the spike gene and 
supports evidence from other CoVs on the role of this protein in 
determining host and tissue tropism (Sanchez et al., 1992, 1999; 
Schickli et al., 2004; Thackray and Holmes, 2004). 


Intra-isolate quasispecies 
Additionally the genomes of the uncloned wild type AH65-R 


and AH187-E BCoVs contained a number of sequence 
ambiguities, i.e. nucleotide positions which could not be 


resolved by manual analysis of sequence traces or exhaustive 
analysis by RT-PCR and cloning. Irresolvable sequence 
ambiguities in otherwise clean sequence data can represent 
the presence of multiple RNA species present in a sample. The 
sequence generated during this study was derived from RT-PCR 
and thus represents a description of a population of viruses 
present in the strain at the time of purification. Sequence 
ambiguities were found at 7 and 84 nucleotide positions in the 
genomes of AH65-R and AH187-E, respectively, suggesting the 
existence of intra-isolate quasispecies of BCoV in the same 
animal. 

The BCoV intra-host quasispecies can develop through 
pathways other than genomic RNA mutations introduced during 
viral replication in the host. For example, exposure to and the 
mixing of respiratory and enteric BCoV strains under field 
conditions and within a host may represent one source for 
mixtures of different viral populations. The possibility also 
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Table 2 Table 2 (continued) 
Nucleotide differences between the AH187-E and AH187-R strains* nt? AHI87-F° AHI87-R 
at” Soe AHISTR. ORF Ip weno ntl3939-21488 
ORF 1a gene nt205—13356 14672 Y C 
274 G T 14744 T C 
576 Ww T 15020 A G 
TAT Y Cc 15095 T C 
1342 G Cc 15170 ¥ G 
1474 G A 15285 Y C 
2074 C E 15513 Ww Tr 
2224 R G 16193 Y s 
2778 R G 16505 K G 
3112 Y Cc 16814 F G 
3723 Y 6} 16847 TF Cc 
3836 K G 16925 4 s 
3843 At Cc 17018 g Tr 
3948 Ps Tf 17882 R G 
3993 Y G 18311 R G 
4215 T ¢ 18479 W ar 
4506 3 Cc 18515 ¥ Cc 
4749 Ai Cc 19478 Cc T 
4961 G A 19556 G A 
5487 K ue 19631 T C 
5541 Y Cc 19643 A iC 
5559 C T 20208 G Er 
5721 G TE 20687 R G 
6465 G A 20795 Y T 
6476 C A 20828 R G 
6672 Ay iC. 20927 K G 
W317 lt Cc 21236 G A 
7854 Y T 
7947 K G 32 kDa NSP gene nt21498—22334 
7977 Y T 21563 T iG 
8223 T A 21662 Y T 
8281 M A 21719 r iT 
8436 a: C 21842 Y £ 
8575 G A 21962 Y GC 
8746 Y G 21999 M c 
8982 C uh 22025 M A 
9402 4 e 
9633 Cc T HE gene nt22346—23620 
9948 Y T 22559 K G 
10263 R A 22600 Y T 
10365 A G 23056 T iG 
10673 Y Cc 23444 Cc al 
10860 C T 23446 Cc a 
11196 Y T 23494 a C 
11223 T Cc 
11259 Y Cc S gene nt23635—27726 
11280 Y Cc 23704 G Cc 
11308 C T 23835 Y T 
11439 R G 23890 Tr C 
11844 C T 24264 Y iC 
12558 Ww A 24426 Y G 
12609 Y C 24690 Cc T 
12729 R G 25130 G A 
12861 A C 25135 F C 
12996 A G 25160 A CG 
13017 C T 25163 C G 
13091 G A 25167 Tr Cc 
25296 ¥ Cc 
ORF1b gene nt13335—21488 25305 Y C 
13418 Y F 25359 Y C 
13631 R A 25484 Y C 
14429 Y T 25862 K TE 
14513 R A 28595 G A 
14651 Ww T 26215 Cc ir 
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Table 2 (continued) 


> 
aa 
oo 
te 
mY 


nt? AH187-R 


S gene nt23635—27726 
26718 
27030 
27138 
27156 
27173 
27204 
27219 
27220 
27360 
27531 
27663 
27720 


|EIN| SIM > GINIRIN|SININ 
>arPHasastoaAastsAo 


4.9 kDa NSP gene nt27716—27805 
27746 
27812 
27842 


QIN 
4 


4.8 kDa NSP gene nt27883—28020 
27893 
27947 
28044 
28073 
28078 


Als QaQ0 
aarsas 


M gene nt28685—29377 
28716 
28855 
29110 


ae 
QQ YS 


N gene nt29387-30733 
29532 
29680 
29702 
29823 
30166 
30718 


IMPs] S|EIRO 
O> Pager 


3’ UTR nt30734-30894 


30767 Cc 


[sq 


* The two genomic sequences were aligned with ClustalW program in Lasergene 
software (DNASTAR Inc). Uniform length of 30894 nt was achieved after 
trimming sequence ends. Positions for nucleotide changes in individual genes 
are shown. “The ambiguous nucleotides are in bold italics and underlined. 


exists that the calves swallowed respiratory secretions and 
hence respiratory viruses were introduced orally into the 
intestinal tract. Likewise, calves’ nostrils are also possibly 
contaminated with enteric viruses by direct contact and 
inhalation of feces. Formation of intra-isolate viral quasispecies 
highlights the complexity of BCoV evolution under field 
conditions. The present study did not show a tendency of 
intra-isolate quasispecies formation of BCoV-E strains com- 
pared to BCoV-R strains. Although the AH187-E strain had 
more ambiguities than the AH65-R strain (84 nt vs. 7 nt), 
neither the AH65-E nor the AH187-R had any ambiguities. Our 
quasispecies data cannot solely be explained by potential 
physical contamination between respiratory and enteric sample 
materials because | and 6 nt ambiguities were also found in the 
AH65-E-TC and AH65-R-TC isolates, respectively after tissue 


Table 3 


Pair-wise comparison of divergent nucleotide positions among the six BCoVs"* 


nt” 


AHO65-E AH65-E-TC AH65-R AH65-R-TC AH187-E AH187-R 


5’ UTR ntl—-204 


a3 


ORF 1a gene nt205—13356 


274 

576 

TAT 

1137 
1209 
1334 
1342 
1474 
1596 
2074 
2086 
2224 
2703 
2778 
2928 
3112 
3251 
3480 
3540 
3723 
3836 
3843 
3948 
3993 
4215 
4506 
4749 
4776 
4791 
4926 
4961 
5097 
5112 
5310 
5346 
5487 
5541 
5559 
5624 
5721 
6303 
6387 
6465 
6476 
6508 
6672 
6741 
6917 
7023 
7179 
7227 
7317 
7638 
7767 
7854 
7947 
7971 
7977 
8223 


Cc 


AAA HAHA QOMDAA NP HAHAHA APTA ANAMQANANAAQAAAANHAAANHA AH AHPAONTFOHFONH OHH PHP PANAAPAAAAA 


C 


FAHAHAHHAQHAHAPH PHAN ASFHAHAOHAHAAPFPANAAAANSPHEyHAAAHAHANNAHAAQAANAAANAHAAPHTPAHA 


AHHK DAA AASFHAANAMDSFHAANASFAAAAAAASPANANAAAAHHAHAHAHAHANQIANANANAAAArPAAAAAH?F AHA 


AHAQDAAR A HAPHQNANPHAADAPANDAHAAAADAPFPANANAAANQPHAAAAHANQIANANAGQAAArPAHAQNAAHPAADA 


[o! 


C 


AKAARKCMNOHAONFOSFAANMOAAAQAMOKAPH HA AQININONAKAK KAA KAA OK NAA APAHAAMDSP ADK AO 
FHAQHAANMNANF OP AO-P PHA AHAAOHA AA PH HAH PaAMAaAaAAANANHNAANaAAAAANAAANAaAPAAFrAPrPAAANANA 


(continued on next page) 
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Table 3 (continued) Table 3 (continued) 
nt? AH65-E AH65-E-TC AH65-R AH65-R-TC AH187-E AH187-R nt? AH65-E AH65-E-TC AH65-R AH65-R-TC AH187-E AH187-R 


ORF 1a gene nt205—13356 ORF'1b gene nt13335—21488 
8281 C A A A M A 17882 G T T I R G 
8436 C ‘6 C Cc T Cc 17912 C T T Tr C Cc 
8575 G G G G G A 18311 G G G G R G 
8746 T T T T Y Cc 18374 C T T T C Cc 
8910 C @ T T Cc Cc 18479 T iE T T W T 
8982 C Cc C Cc Cc T 18515 T Cc Cc CG Y Cc 
9402. T T C Cc ¥ Cc 18692 C T T T C Cc 
9633 C C C C Cc T 19478 C Cc Cc C Cc T 
9948 =C T T T Y¥ T 19556 A A A A G A 
10263 G G A A R A 19631 T T T T T Cc 
10365 A A A A A G 19643 C Cc Cc C A Cc 
10692. T T C Cc T T 19691 G A A A G G 
10726 C T Cc Cc c Cc 20208 T G G G G T 
10860 C T T T Cc T 20321 A G G G A A 
10987. T T Y T T T 20475 C T T T C Cc 
11155 G G R R G G 20687 G G G G R G 
11196 T T ¥ Y Y T 20795 T ale T T Bg T 
11223 C Cc Cc Cc T Cc 20828 A G G G R G 
11259 ¢ C C Y T Cc 20927 T G G G K G 
11280 C Cc T T Y C 21236 G G G G G A 
11282 G G G R G G 
11308 T T T T Cc T 32 kDa NSP gene nt21498—22334 
11322 C ‘6 Cc ¥: Cc Cc 21563 T T T Tr T Cc 
11439 A G G G R G 21662 T T a Y T 
11844 T T T T Cc T 21719 T ale T ny Y: ot 
11853 A T T T T T 21842 T T T T Y T 
12105 C GC T T Cc Cc 21962 C Cc Cc Cc Y Cc 
12138 C Cc T T Cc Cc 21999 A Cc Cc Cc M Cc 
12558 A A T T W A 22025 A A A A M A 
12609 C Cc T T Y C 
12729 G G G G R G HE gene nt22346—23620 
12861 C G A A A Cc 22559 G T T T K G 
12996 G G G G A G 22600 T Cc C Cc Bg T 
13017 C 6: Cc Cc Cc T 22656 A A G G A A 
13091 A A A A G A 22826 C A Cc Cc Cc Cc 
13125 A A C Cc A A 23056 T T T Tr T Cc 
23137 C os T T Tr T 
ORF'1b gene nt13335—21488 23242 C T T bu T T 
13418 T T T T Y T 23384 T A iD T E T 
13631 A A A A R A 23444 C Cc C C Cc T 
14429 T T T T Y T 23446 C Cc Cc Cc Cc T 
14513 G A A A R A 23494 T T T T rT Cc 
14648 T CG T Tr a T 
14651 T T T T W T S gene nt23635—27726 
14672. T C T T Y Cc 23664 A Cc Cc Cc A A 
14744 T T T T at € 23704 C G G G G Cc 
15011 T Cc T T T T 23706 T Cc Cc C T rE 
15020 G G G G A G 23835 T Cc C C Y rE 
15095. T T T T Ty Cc 23890 T T T Tr T Cc 
15170 T T T Tt T G 23971 A G G G A A 
15285 T Cc Cc 1@ 4 € 24039 C ae T Te Cc G 
15513, A T T T W T 24165 T Cc Cc C T T 
16193. T CG Cc C Y Cc 24237 T Cc Cc Cc Tr al 
16198 T T G G T T 24264 T Cc C Cc Y: Cc 
16505 T G G G K G 24426 C Cc @ Cc ¥ Cc 
16589 T C T T Cc Cc 24690 T Cc Cc Cc Cc T 
16814. T T T T T G 24786 T Cc C C T T 
16847. T T T Tr a Cc 25130 G G G G G A 
16925 T T T T Y Cc 25135. T T T T T C 
17018 C Cc GC Cc Cc T 25160 A A A A A Cc 
17078 C T C C Cc Cc 25163 C Cc C C Cc G 
17102 G A G G G G 25167 T Te T r Tr G 


Table 3 (continued) 
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Table 3 (continued) 


nt? 


AH65-E AH65-E-TC AH65-R AH65-R-TC AH187-E AH187-R 


nt? AHO65-E AH65-E-TC AH65-R AH65-R-TC AH187-E AH187-R 


S gene nt23635—27726 


25225 
25296 
25305 
25345 
25359 
25484 
25578 
25862 
25884 
25895 
26124 
26215 
26718 
26791 
27030 
27138 
27156 
27173 
27204 
27219 
27220 
27360 
27399 
27531 
27585 
27663 
27684 
27720 


PHOArPPHPHFAOPAaAHAAaANAaNHAAAPrAHWOaAAaAAHvAAaAN.NA 


rFrHArPHAHAPH HOH eH PH anaHWaacaanaHaosAdts 


rPrPAPHAAPHAHAHPHOAAANHAANANANANAOASHAS 


4.9 kDa NSP gene nt27716—27805 


27746 
27798 
27812 
27842 


A 


Cc 
T 
G 


G 
T 


Cc 
G 


G 
T 


C 
G 


4.8 kDa NSP gene nt27883—28020 


27893 
27936 
27943 
27947" 
28044 
28073 
28078 
28087 


12.7 kDa NSP gene nt28100-28429 


28382 


M gene nt28685—29377 


28716 
28855 
29110 
29215 


N gene nt29387-30733 


29532 
29680 
29702 
29823 
30118 
30166 
30535 
30543 


aAAgAAAINaQHAA 


A 


A 
ah 
T 
al 


QAQAQAAAPPOA 


a0 


AHAAANAINA 


G 


A 


Cc 
Cc 
T 


gn 


AQAPrPHAPO 


QHaAaAINANO 


G 


ANNALS 


Arraraand 


>S>PHPHAHrFHHOAHeFHNNHAHANANANNHASNSsH:S 


aaQgnAaA 


QHAaQAIN ANON 


Qa 


AANA S 


aAPrraraand 


ZoKnrearKerdKAKeaKAKNNQMAANAAKKAKKO 


AKA 


QAARAKAINDQSHOA 


Q 


daKKH 


aaradgeFa 


PaHAHPrPrPAFANHoOaHFPrFPHaNAaNHANPrPAaAHAANAHAAaANDM 


AHOaPY> 


QAQAXPINADASA 


Q 


Haa>D> 


QAaArPAPIAAGA 


N gene nt29387-30733 


30639 T T C Gc T T 
30718 T Cc C Cc ¥ C 
3’ UTR nt30734-30894 

30767 C Cc C Cc Y Cc 
30893 T rT. T G Tr i 


“The six genomic sequences were aligned with ClustalW program in Lasergene 
software (DNASTAR Inc.). Uniform length of 30894 nt was achieved after 
trimming sequence ends. Positions for nucleotide changes in individual genes 
are shown. “The common mutation from enteric to respiratory strains at 
nucleotide position 27947 is in bold italics and underlined. 


culture passage. The single ambiguity at nucleotide position 
2086 of the AH65-E-TC apparently occurred after tissue culture 
adaptation and plaque purification (Table 1). Similarly, ambig- 
uous positions at nucleotides 11,259, 11,282, and 11,322 of 
AH65-R-TC were observed after tissue culture passage and 
plaque purification. The intra-host BCoV genetic diversities and 
the intra-isolate ambiguities discovered in the tissue culture 
BCoV isolates indicate that the mixtures of genetically different 
viruses were formed intrinsically during viral replication within 
the host or developed after tissue culture adaptation. 


Changes after tissue culture adaptation 


The AH65-E-TC isolate showed 104 nt mutations from the 
parental AH65-E strain after 15 passages in cell culture plus 1 
plaque purification. In contrast, the genome of the AH65-R-TC 
isolate changed at only 8 nucleotide positions from the AH65-R 
strain after a similar number of cell culture passages (14 tissue 
culture passages and | plaque purification). These findings 
suggest that enteric BCoV strains are more prone to change than 
respiratory BCoV strains during cell culture adaptation and 
passage. Unexpectedly, of the 104 mutations observed in the 
AH65-E-TC, 86 mutated to be the same as the corresponding 
nucleotides of the AH65-R isolate (Table 1). As a result, the 
AH65-E-TC genetically resembled the AH65-R strain more 
than the parent AH65-E strain. Particularly noteworthy, whereas 
the spike gene of the AH65-E differed from that of the AH65-R 
at 31 nucleotide positions, there was only one nucleotide 
difference in spike genes between the AH65-E-TC isolate and 
the AH65-R strain (Table 1). This is surprising because the 
HRT-18 cells used for in vitro viral adaptation and passage are 
human rectum epithelial cells and therefore are of intestinal and 
not respiratory origin. However, the AH65-E-TC isolate still 
differed from the AH65-R strain at 48 nucleotides with 37 of 
these divergences located in ORF la and 1b genes. The latter 
substantial genomic differences argue against the AH65-E-TC 
isolate being an AH65-R variant that arose from contamination 
during the process of tissue culture adaptation and passage. The 
data from these isolates suggest that similar evolutionary 
processes may have occurred in vitro during BCoV adaptation 
to cell culture as those happening in vivo during adaptation from 
enteric to respiratory tropism. Only the AH65-E-TC and AH65- 
R-TC pair was selected initially for sequencing to see if they 
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will be any significant mutations in the genome of enteric and 
respiratory BCoV strains after tissue culture passage. Given the 
interesting trend of mutations observed in the AH65-E-TC and 
AH65-R-TC genomes, sequencing of the other enteric and 
respiratory BCoV pair (e.g. AH187-E-TC and AH187-R-TC) 
will be attempted in the future to confirm if similar mutations 
occur in AH187-E-TC that mimic the AH187-R strain 
sequence. However, differences in the number and trend of 
mutations between AH187-E/AH187-R and AH65-E/AH65-R 
strains after tissue culture passage are expected based on the 
divergence of their genomes. In addition, it is possible that 
certain BCoV strains may require more cumulative passages in 
tissue culture than others to achieve significant genetic 
mutations, tropism shifts, or changes in virulence. Nevertheless, 
our observation that BCoV-E gained genetic mutations 
resembling the sequence of BCoV-R after tissue culture passage 
is consistent with that found for other members of the 
coronavirus family. For example, transmissible gastroenteritis 
virus (TGEV) strains after 60 to 115 continuous tissue passages 
may lose their enteric tropism and gain respiratory tropism 
(Harada et al., 1969; Zhang et al., 2007). 

Until now, it has been unclear whether nasal and rectal 
isolates of BCoVs are the same virus or whether BCoV 
undergoes significant genetic changes during adaptation to 
different tissues of the host. In general, biological, antigenic, 
and genetic differences were found between BCoV-R and 
BCoV-E (Gelinas et al., 2001a, 2001b; Hasoksuz et al., 1999; 
Lin et al., 2002). However, other investigators did not find 
consistent differences in biologic properties between BCoV-R 
and BCoV-E strains (Reynolds et al., 1985; Zhang et al., 1994a). 
The inconsistent results are partially ascribed to the fact that the 
BCoV strains compared varied widely in origin or year of 
isolation; for example, comparisons between BCoV-R and 
BCoV-E strains have often been compromised when viruses 
were isolated from different animals at different times 
(Reynolds et al., 1985). To accurately define tropism determi- 
nants of BCoV-R and BCoV-E, a systematic investigation of the 
complete genome sequences of BCoV-R and BCoV-E strains 
isolated concurrently from the same animals and examined both 
prior to and after cell culture adaptation is necessary. 

Only in recent years, has the genetic diversity of BCoV 
within the same animal been studied, but either only partial 
genomes were sequenced or the field BCoV isolates were first 
adapted to cell culture before sequencing. To our knowledge, 
this is the first report on sequencing and comparison of the full 
genomes of the original wild type paired animal BCoV 
specimens that fully describe the composition and complexity 
of BCoV intra-host evolution and the first to fully describe the 
existence of BCoV quasispecies. Chouljenko et al. (2001) 
sequenced a paired respiratory (LUN strain) and enteric (ENT 
strain) BCoV from the same animal with fatal shipping fever 
pneumonia and they found 107 nucleotide variations through- 
out the genomes between these two viruses. However, in that 
study the original BCoV samples were subjected to multiple cell 
culture passages and plaque purifications before sequencing. 
Such manipulations have the potential to cause genetic changes 
in BCoV genomes as evidenced in our study. In addition, no 


nucleotide ambiguities in the genomes were reported in that 
study possibly because the BCoVs were plaque purified 
multiple times and were sequenced immediately after plaque 
purification. 

Previously, the S1 subunit of the spike gene of the tissue 
culture passaged and plaque purified AH65-E and AH65-R (8 
passages for AH65-E and 6 passages for AH65-R) and 
AH187-E and AH187-R (2 passages for AH187-E and 3 
passages for AH187-R) pairs were analyzed in our lab 
(Hasoksuz et al., 2002b). In that study, the S1 protein was 
found almost identical to each other between the concurrent 
enteric and respiratory strains except for one and two amino 
acid differences identified for the AH65-E/AH65-R pair and 
the AH187-E/AH187-R pair, respectively. Those findings are 
consistent with the observations in the present study that the 
S gene sequence of the AH65-E strain after tissue culture 
adaptation and passage closely resembled that of the AH65-R 
strain. However, discrepancies in amino acid residues at 
certain positions were found in the SI region of previous 
sequences and present sequences and no nucleotide ambiguity 
positions were observed in that region in the previous study. 
The sequence discrepancy and lack of ambiguity positions 
likely reflect the fact that the viruses in our previous study 
were at different cell culture passage levels and were 
sequenced immediately after the plaque purification. 

It is widely speculated that the spike protein of CoV is the 
major determinant for tropism. The spike gene of TGEV was 
identified as the determinant for enteric tropism (Sanchez et 
al., 1999). Mutations at two nucleotides, nt positions 214 and 
655, in the spike gene of TGEV can potentially cause a shift 
from enteric to respiratory tropism (Ballesteros et al., 1997). 
We did not observe mutations at nt positions 214 and 655 in 
the spike gene (corresponding to nt 23,848 and 24,289 in the 
genome, respectively) of AH65-E/AH65-R and AH187-E/ 
AH187-R strains, suggesting that determinants for enteric to 
respiratory tropism shifts may be different between the group 
1 TGEV CoV and the group 2 BCoVs. Chouljenko et al. 
(1998) proposed amino acid 769 immediately downstream of 
the cleavage site (764—768) and amino acid 1026 of the spike 
protein as two critical residues responsible for respiratory 
tropism of BCoV. In our BCoV-E and BCoV-R pairs, no 
amino acid difference was found at these two positions, 
suggesting that the amino acid positions 769 and 1026 are not 
relevant to tropism shifts for our strains. Our sequence 
analysis cannot pinpoint specific amino acid residue determi- 
nants for tropism because the enteric strains differed from 
respiratory strains at multiple positions. Analysis of the two 
pairs of viruses did not identify many common genetic 
changes between BCoV-E and BCoV-R strains except for a 
shared C to T mutation at nt 27,947 resulting in aa T to I 
mutation in the 4.8 kDa non-structural protein. This mutation 
was not among the 107 genomic mutations observed between 
the respiratory and enteric BCoV pair LUN/ENT (Chouljenko 
et al., 2001). Further study is needed to verify the exact function 
of this mutation by reverse genetics and animal studies. 
However, it is likely that for BCoV tissue tropism, shifts are 
determined by multiple factors involving a combination of 
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genetic mutations. Others observed a truncated 4.9 kDa non- 
structural protein of 29 aa in BCoV-R strains instead of 43 aa for 
BCoV-E strains and suggested that this protein could be 
involved in determining respiratory tropism (Gelinas et al., 
2001b; Vijgen et al., 2006). But this hypothesis also has not been 
experimentally proven. Our sequence data indicated that all 
respiratory and enteric BCoVs have the truncated 4.9 kDa non- 
structural protein of 29 aa, suggesting that truncation of this 
protein is not necessarily needed for enteric to respiratory 
tropism shift. Involvement of mutations in this protein in tropism 
shift remains to be verified. 

Our analysis has defined the presence of multiple BCoV 
species present in the same animal. We detected evidence for 
intra-host quasispecies (significant sequence variation between 
enteric and respiratory strains) and also intra-isolate quasis- 
pecies (sequence ambiguities within the same strains). The 
comprehensive dissection of BCoV quasispecies will aid in an 
understanding of BCoV population dynamics and pathogen- 
esis. The presence of intra-host and intra-isolate BCoV 
quasispecies suggests that BCoV evolves through selection 
pressures working on a quasispecies basis. Recent evidence has 
suggested that many RNA viruses exist as quasispecies in vivo 
and in vitro (Domingo et al., 1998). Viral quasispecies are 
characterized by continuous genetic variation within virus 
populations which is a result of the high error rates of most 
RNA virus-encoded RNA-dependent RNA polymerases. Qua- 
sispecies evolution can occur in individual hosts, geographi- 
cally related areas, or different temporal phases during the 
course of infection. Examples of RNA viruses with distinctive 
quasispecies include bovine viral diarrhea virus, West Nile 
virus, and hepatitis A virus (Costa-Mattioli et al., 2006; Jerzak 
et al., 2005; Jones et al., 2002). Rapid evolutionary change 
allows viruses to adapt to changing micro or macro environ- 
ments, increase viral fitness, and maintain balance with host 
immune systems in order to achieve long-term survival in 
nature. It is believed that pathogenesis is determined by viral 
quasispecies rather than by the action of a single genotype 
(Vignuzzi et al., 2006). Increasing evidence indicates that 
quasispecies development may lead to the selection of virulent 
viruses and to the emergence of new viral species (Domingo, 
1997; Domingo et al., 2006; Holland et al., 1982). Research 
suggests that SARS-CoV, a newly emerging human CoV, exists 
as quasispecies in individual patients (Xu et al., 2004) and a 
genetically diverse population of SARS-like coronaviruses are 
present in geographically closely related Chinese horseshoe 
bats (Ren et al., 2006). 

Understanding BCoV evolution is important because not 
only is this virus of economical importance to the cattle 
industry, but also this virus shows antigenic and biological 
similarities to other group 2a human CoVs including HCoV- 
OC43 and shares biological, pathogenic properties, and 
pneumoenteric tropisms with group 2b SARS-CoVs. Based 
on sequence analysis, researchers have shown the potential 
zoonotic transmission of a BCoV to human beings (Vijgen et 
al., 2005) and previously confirmed the isolation of a bovine- 
like CoV from a child with diarrhea (Zhang et al., 1994b). 
Moreover the bovine-like, human CoV isolate infected and 


caused disease in gnotobiotic calves (Han et al., 2006). 
Information on BCoVs genomics will aid in our understanding 
of the evolution of both BCoVs and human CoVs including 
HCoV-OC43 and the recently emerged SARS-CoV. 
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